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"MOLECULAR CLONINQ AND CHARACTERIZATION OF A FURTHER GENE SEQUENCE CODING FOR 

HUMAN RELAXIN" 

This invention relates to the nr^olecular cloning and characterization of a gene sequence coding for 
hunnan relaxin. The Invention is also concerned with recombinant DNA techniques for the preparation of 
human relaxin. prorelaxin and preprorelaxin. 

In our Australian Patent Application No. 17906/83 (PF 5362/82. filed 12th August. 1982), we described 
5 the molecular cloning and characterization of a gene sequence coding for human relaxin. We have now 
found a second gene which also codes for human relaxin. 

More specifically, this invention relates to an isolated and purified (i.e.. "cloned") human gene coding 
for prorelaxin, preprorelaxin, and the A and/or B and/or C peptide chains of human relaxin, methods for 
isolating and purifying the genes and a method for transferring the genes to and replicating the genes in a 
10 host cell. The cloned genes are expressed by the host cell when fused with a host-expressable procaryotic 
or eucaryotic gene. The genes are thus useful in the production of human relaxin for therapeutic purposes. 

The invention also relation to the peptides human reiaxin. prorelaxin and preprorelaxin, to the individual 
peptide chains which comprise these sequences and to modified forms of these peptides. 

The invention further relates to modified genes coding for the individual relaxin chains and for the. 
;5 above-mentioned modified forms. 

[Note: References used in the following description are collected at the end of the description.] 
Pioneering work by Hisaw (1926) suggested an Important role for the peptide hormone relaxin in 
mammals through its effects in dilating the pubic symphysis, thus facilitating the birth process. Relaxin is 
synthesized and stored in the corpora lutea of ovaries during pregnancy and is released into the blood 
20 stream prior to parturition. The availability of ovaries has enabled the isolation and amino acid sequence 
determination of relaxin from pig (James et al, 1977; Schwabe et al. 1977) rat (John et al, 1981) and shark 
(Schwabe et al. 1982). The biologically active hormone consists of two peptide chains (known as the A and 
6 chains) field together by disulphide bonds, two inter-chain and one intra^chain. The structure thus closely 
resembles insulin in the disposition of disulphide bonds which has led to speculation of a common ancestral 
25 gene for these hormones (James et al. 1977; Schwabe et al, 1977). 

Recombinant DNA techniques have been applied to the isolation of cDNA clones for both rat and 
porcine relaxins (Hudson et al, 1981; Haley et al. 1982), see also Australian Patent Application No. 11834/83 
(PF 2696/82). Synthetic undecamer nucleotides, prepared on the basis of amino acid sequence information, 
were used as primers for the synthesis of cDNA probes greatly enriched in relaxin cDNA sequences which 
30 identified relaxin cDNA clones in libraries derived from both rat and porcine ovarian tissue. The relaxin 
structural gene was found to code in both cases for a single chain precursor which resembles preproinsuiin 
in the overall configuration. i.e., signal peptide/6 chain/C peptide/A chain. 

In our Application No. 17906/83 we described the use of probes based on the C peptide region of 
porcine relaxin to select out a relaxin gene from a human genomic library. This approach resulted in the 
35 successful identification of a genomic clone which we have now designated "Hi " from which the structure 
of the entire coding region of a human preprorelaxin was determined. 

The present invention arises from a continuation of the work described in Appiication No. 17906.83 in 
which we sought to confirm the gene structure described in that application. We have investigated cDNA 
clones in libraries derived from ovarian tissue from pregnant human females using as a probe a segment of 
40 the previously identified human H1 gene corresponding to approximately 300 nucleotides of the C 
peptide/A-chain coding region (amino acids 64-161). Positive cDNA clones were isolated and sequencing of 
these revealed a cDNA sequence which was not identical with the sequence previously established and 
which coded for a form of preprorelaxin different to that described in our earlier application. 

We have also isolated from the human genomic library described in our copending Australian Patent 
45 application No. 17906/83 (PF 5352/82) a recombinant phage containing exon i of the H2 gene where exon 
1 comprises the coding region of the signal. B-peptide. and part of the C-peptide similar to that of the Hi - 
gene. 

It is now believed that either or both the presently-described gene which we have designated "H2" and 
the "HI" gene described in our Application No. 17906/83 are expressed in human reproductive tissue, for 
50 example ovary and placenta, and/or other tissues including but not limited to gut brain and skin, since both 
genes express peptides with relaxin-like activity. 

The corpora lutea of the ovary as well as decidual and placental tissues are the most likely sites for 
expression of relaxin-related genes. However, in view of the wide distribution of many peptide hormones it 
is highly likely that the relaxin gene is also expressed in non-reproductive tissues, including brain and the 
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gastrointestinal tract. Relaxin has the general properties of a growth factor and is capable of altering the 
nature of connective tissue and influencing smooth muscle contraction. We believe that one or both of the 
gene structures described in this specification and that of Application No. 17906/83 to be widely distributed 
in the body. We suggest that the relaxin peptides expressed from these genes will play an important 
physiological role in addition to their well documented hormonal function during reproduction. 
The following abbreviations are used In this description, 

HI - the relaxin gene described In Application No. 17906/83. being deduced from a genomic clone. 
H2 - the relaxin gene described herein, being deduced from a cDNA clone. 
DNA - deoxyribonucleic acid 
RNA - ribonucleic acid 

cDNA - complementary DNA (enzymatically synthesized from an mRNA sequence 

mRNA - messenger RNA 

A Adenine 

T- Thymine 

G - Guanine 

C- Cytosine 

U - Uracil 

• The coding relationships between nucleotide sequence in DNA and amino acid sequence in protein are 
collectively known as the genetic code, which is set out below. 



First position 




Second position 


Third position 


(5 end) 










(3' end) 




U 


c 


A 


G 




U 


Phe 


Ser 


Tyr 


Cys 


U 




Phe 


Ser 


Tyr 


Cys 


c 




Leu 


Ser 


Stop 


Stop 


A 




Leu 


Ser 


Stop 


Trp 


G 


C 


Leu 


Pro 


His 


Arg 


U 




Leu 


Pro 


His 


Arg 


G 




Leu 


Pro 


Gin 


Arg 


A 




Leu 


Pro 


Gin 


Arg 


Q 


A 


He 


Thr 


Asn 


Ser 


U 




lie 


Thr 


Asn 


Ser 


C 




lie 


Thr 


Lys 


Arg 


A 




Met 


Thr 


Lys 


Arg 


G 


Q 


VaJ 


Ala 


Asp 


Gly 


U 




Val 


Ala 


Asp 


Gly 


C 




Val 


Ala 


Glu 


Gly 


A 




Vai 


Ala 


Glu 


Gly 


G 



The abbreviations used for the amino acids In the table are Identified as follows. 



Phenylalanine 


(Phe) 


Histidlne 


(His) 


Leucine 


(Leu) 


Glutamine 


(Qln) 


Isoleucine 


(lie) 


Asparagine 


(Asn) 


Methionine 


(Met) 


Lysine 


(Lys) 


Valine 


(Val) 


Aspartic acid 


(Asp) 


Serine 


(Ser) 


Glutamic acid 


(Glu) 


Proline 


(Pro) 


Cysteine 


(Cys) 


Threonine 


(Thr) 


Tryptophan 


(Try) 


Alanine 


(Ala) 


Arginine 


(Arg) 


Tyrosine 


(Tyr) 


Glycine 


(Gly) 
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Each 3-lett0r codon represented in the table, e.g.. AUG, CAU (otherwise known as a deoxynucleoiide 
triplet or nucleotide triplet) corresponds to a trinucleotide of mRNA. having a 5 -end on the left and a 3 -end 
on the right. The letters stand for the purine or pyrimldine bases forming the nucleotide sequence. All DNA 
sequences given herein are those of the strand whose sequence corresponds to the mRNA sequence, with 
5 thymine (T) substituted for uracil (U). 

The invention will be further described and elucidated in the following discussion. Reference will be 
made to the accompanying drawings in which: 

Figure i : shows an abbreviated restriction map and sequencing strategy for the cDNA clone in 
. pBR322, genomic clone H11, and GT10 cDNA clones a-f. Arrows indicate the direction of sequencing on 
10 end-labelled fragments (see methods). QT10 clones a-f were sequenced by subcloning into an M13 vector 
as described hereinafter. Nucleotides are numbered from the AUG initiation codon. position 1-3, to the 
termination codon, position 554-556. 

Figure 2 : compares the amino acid and mRNA sequence of human preprorelaxin H2 (upper) with the 
corresponding HI (lower) sequence. The sequences have been aligned to maximize homology with 
rs nucleotide identities being indicated by asterisics and amino acid homologues by boxed-in areas. Amino 
acidis are numbered from the start of the B-chain (H2 gene sequence starting at -1 and Hi sequence at > 1) 
•although this position represents only the hypothetical start of the B chain sequence and has been deduced 
simply from the homology to the related porcine and rat preprorelaxin structures. The asterisk beneath Ala 
45 In the C peptide denotes the position of an intron in the G/CA codon in both genes. 
20 Figure 3: autoradlographs of identical nitrocellulose strips taken from either a) Northern gel transfer of 

human ovarian RNA or b) Xplaques corresponding to the Hi gene (XH7) or H2 gene (XGTlO-a) using as 
hybridization probes A: a random primed 600 bp H2 relaxin cDNA fragment (72-660), B: H2-specific 25 mer 
(483-507), C: H1 -specific 25 mer (483-507), D: Hi-specific 25 mer (248-272). 

Figure 4 ; autoradlographs of identical nitrocellulose strips following Northern gel transfer of human 
25 oyarian RNA using as hybridization probes fragments of the H2 cDNA clone in pBR322 (see Figure 1). 
A: 600 bp fragment (72-660) corresponding to most of the coding region. 
B: 5' untranslated region (to Hinf I site at nucleotide 30) 
C: 3' untranslated region (from Hinf I site at nucleotide 660) 
0: 3'. untranslated region (from Hpa t site at nucleotide 850) 
30 Figure 5: comparison of the amino acid sequences of the B and A chains between the two human 

relaxin genes, human insulin, and other members of the relaxin family. Boxed areas highlight residue which 
are conserved between the two human relaxin genes and the relaxin family. Arrows indicate probable sites 
of proteolytic cleavage with confirmation by protein sequencing data of the amino terminal residue of the B 
and A chains of porcine (Schwabe et aJ. 1977; James et al, 1977), rat (John et a). 1981). shark {Schwabe et 
35 al, 1982) and dogfish relaxins (Schwabe et aK 1983). 

The H2 mRNA sequence shown in Figure 2 was determined by the methods described hereinafter. For 
ease of comparison, the numbering of the amino acids previously used for the peptide derived from the Hi 
sequence has been maintained in the present description of the H2-derived peptide. The structure of H1- 
preprorelaxtn was deduced from the genomic sequence by comparison with the homologous structures of 
40 pig and rat relaxin. The H2-preprorelaxin structure was deduced by comparison with the Hi structure as 
well as the pig and rat structures. Confiirmation of the A and B peptide chain structures has been provided 
by synthesis and chain recombination In vitro which produces a material which is biologically active in the 
uterine contraction assay. 

It will be seen from Figure 2 that the present and previous sequences show significant differences as 
45 well as similarities. Notable are: 

' (1) Significant amino acid differences in three main areas: 

(a) the N-termlnus of the B-chain 

(b) the N-terminus of the A-chaIn 

(c) the rniddle of the C-peptide. 

50 (2) Regions of strong homology In the B-chain and C-peptide: 

(a) 120 identical bases from Val^ to lle*^ 

(b) 88 - 90 identical bases from Phe^°^ to Ser'32, 

The two genes are. therefore very similar but the differences are sufficient to indicate that the H2- gene 
is tndeed.a second gene and not simply a polymorph of Hi. 
55 The mode of in vitro processing of the HI preprorelaxin is not yet fully known but by analogy with pig 
relaxin, cleavage of the signal peptide would be expected to occur at the Ala^'-Lys' bond. Similarly 
excision of the Hl-C peptide is predicted to occur at Leu^^ -Ser^s and Arg^^e . Arg-^'. thus giving the H1-B 
and HI -A chains of respectively 32 and 24 residues (Figure 2). 
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In H2 preprorelaxin Ala"* has been replaced by Asp and so we would predict cleavage of the signal 
peptide atter the alanine corresponding to position -2 In H1. Cleavage at the H2-B chain^C peptide junction 
is expected after Leu^^ analogy to all other prorelaxins, thus leaving the H2-B chain with 33 residues. 
Cleavage at the H2-C peptide/A chain junction would occur after Arg^^^ by analogy to rat preprorelaxin. thus 
5 leaving the H2-A chain with 24 residues. 

As noted in our studies on pig relaxin. there are core sequences in the pig relaxin B and A chains which 
contain all the essential elements for biological activity. Our synthetic studies on the hunnan relaxin chain 
show similar results, as set out in more detail hereinafter. 

According to one aspect of the present invention, there is provided a gene for the expression of human 
10 H2-preprorelaxjn. 

Except where otherwise specified, all following references to the gene sequences for preprorelaxin. 
prorelaxin. relaxin and the signal, A, B and C peptides, and to the peptides themselves will be understood 
to refer to the H2 varlents and to exclude the HI variants. 

More specifically, this aspect of the Invention provides a double-stranded ONA fragment for the 
75 expression of human preprorelaxin, which comprises a coding strand and a complementary strand 
corresponding to the complete mRNA {codons -25 to 160) sequence shown in Figure 2 of the accompany- 
ing drawings. 

The Invention also includes any sub-unit of the H2-preprorelaxin gene sequence described herein, or 
any equivalent of the said sequence or sub-unit. Among the sub-units to be included by this statement are 
20 the individual structural genes coding for the signal peptide chain and the separate H2-A and H2-B peptides 
and the H2-C chain of human preprorelaxin (see Figure 2) and any combinations of these chains, e.g., the 
genes for expressing the H2-A and H2-B peptides, separately, or as prorelaxin (with the H2-C chain). The 
• sub-units also include fragments and combinations of fragments of any of said gene sequences. 

Thus according to another aspect of the present invention, there is provided a gene for the expression 
25 of human prorelaxin. 

More specifically, this aspect of the invention provides a double-stranded DNA fragment for the 
expression of human prorelaxin. which comprises a coding strand and a complementary strand correspond- 
ing to the codons numbered as 1 to 160 of. the mRNA sequence shown in Figure 2 of the accompanying 
drawings, 

30 According to a further aspect of the present invention, there are provided genes for the separate 
expression of the A, B and C chains of human relaxin or any combination of two or more of the said chains 
and any fragment or combination of fragments of the said chains. 

More specifically, this aspect of the Invention provides double-stranded DNA fragments for the separate 
expression of the A and/or B and/or C chains of human relaxin (or fragments as described above) which 
35 comprise a coding strand and a complementary strand corresponding to the codons numbered -1 to 32, 33 
to 136 and 137 to 160 of the mRNA sequence shown in Figure 2 of the accompanying drawings. 

The genes described above In addition to the codons specified may also include the appropriate "start" 
and -stop". codons. e.g.. AUG and UQA respectively (codons -25 and 161 in Figure 2). 

Those skilled in the art will appreciate that polymorphic forms of the genes may exist. Such forms are 
40 included in the present invention. 

The invention further includes the complements of the above sequences, sub-units or equivalents, and 
the corresponding RNA sequences, sub-units or equivalents. 

According to another aspect of the present invention there is provided a DNA transfer vector comprising 
the deoxynucleotide sequences corresponding to the genes defined above. 
45 As shown above, the genetic code contains redundancies, that is certain amino acids are coded for by 
more than one codon. Thus the invention Includes deoxynucleotide sequences in which the codons 
depicted in the drawings, or their cDNA equivalents are replaced by other codons which code for the same 
amino-acid. 

Furthermore, as already indicated above, peptides with relaxin activity may be produced which differ 
50 from the B and/or A chain structures of natural relaxin. Such differences may involve deletion of one or 
more amino acids and/or addition of further amino acids and/or substitution of different amino acids in the 
natural chains. 

Thus the invention also includes genes and DNA transfer vectors as described above wherein one or 
more of the natural codons are deleted and/or are replaced by codons which code for amino acids other 
55 than that coded by the natural codon. and/or further codons are added to the natural sequence. 

The transfer vectors of the invention may also include inter alia, genetic information which ensures their 
replication when transferred to a host cell. Such cells may include, for example, the cells of procaryotic 
microorganisms, such as bacteria, yeasts and moulds, and also eucaryotic cells, including mammalian cells 
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and ceit lines. 

Examples of transfer vectors commonly used in bacterial genetics are piasmids and the ONA of certain 
bacteriophages. Both phage ONA and bacterial piasmids have been used as the transfer vectors in the 
present work. It will be understood however, that other types of transfer vectors may be employed. The 
s general techniques of forming such transfer vectors and transforming them into microorganisms are well 
known in the art. 

The invention also includes a procaryotic or eucaryotic ceil transformed by any of the transfer vectors 
described above. 

One prefenred microorganism is the very familiar Escherichia coll. but any other suitable microorganism 
70 may be used. 

According to a still further aspect of the present invention, there is provided a process for making a 
ONA transfer vector for use in maintaining and replicating a deoxynucleotide sequence coding for human 
preprorelaxin, characterised by ligating a deoxynucleotide sequence coding for human preprorelaxin with a 
ONA molecule prepared by cleaving a transfer vector with a restriction enzyme. 
75 ONA transfer vectors for use in maintaining and replicating deoxynucleotide sequences coding for 
human prorelaxin and for the A and B chains of human relaxin may be similarly prepared from the 
appropriate deoxynucleotides. 

The A and B peptide chains, and also prorelaxin and preprorelaxin may be prepared by the usual 
process of gene expression, that Is by growing cells containing the appropriate transformed transfer vector 
20 and isolating and purifying the required peptide(s) produced by the cells. 

Thus, the Invention further includes a process for making a fusion protein comprising the amino acid 
sequence of human preprorelaxin as its C-terminal sequence and a portion of a procaryotic or eucaryotic 
protein as its N-terminal sequence, characterised by incubating a ceil culture transformed by an expression 
transfer vector comprising a deoxynucleotide sequence coding for human preprorelaxin, prepared in 
25 accordance with the process described above. 

Fusion proteins comprising the amino acid sequences for human prorelaxin and/or the A and/or B 
and/or C chains of human relaxin may be similarly prepared. 

The fusion peptides products thus produced will be in the form of a fusion protein in which the desired 
peptide is linked with a portion of a procaryotic or eucaryotic protein characteristic of the host cell. Such 
30 fusion proteins also form a part of this invention. 

The Invention also includes a process for synthesizing human prorelaxin comprising the A and B 
peptides separated from each other by a C peptide, characterised by incubating a culture of cells, 
transformed by an expression transfer vector comprising a deoxynucleotide sequence coding for said 
human prorelaxin, prepared as described above, under conditions suitable for expression of said sequence 
35 coding for human prorelaxin, and purifying . human prorelaxin from the lysate or culture medium of said 
cells. 

The peptide of interest can be recovered from the fusion product by any suitable known cleavage 
procedure. 

As already indicated above the transfer vector may be modified by codon substitution /defetion/addrtlon 
40 and such modifications will give rise to modified fusion peptides. In this way appropriate modifications may 
be made to facilitate the cleavage of the fusion peptides, for example, at the junction of B/C or C A chains 
or to modify the peptide chain behaviour during subsequent chemical or biological processing. 
As indicated above, the Invention also provides human relaxin, prorelaxin and preprorelaxin. 
Relaxin may be prepared by direct combination of the separate A and B chains by any of tiie 
45 procedures currently known and used for the preparation of insulin. 

Also in a similar manner to insulin, relaxin may be prepared from prorelaxin by oxidizing or otherwise 
converting the sulfhydryl groups on the A and B peptides of relaxin, prepared as described herein, to form 
disulfide crosslinks between said A and B peptides, and then excising the C peptides, for example, by an 
enzyme-catalyzed hydrolysis specific for the bonds joining the 0 peptide to the A and B peptides. 
50 Accordingly, the present invention further provides a method for the synthesis of human relaxin which 
comprises combining the A and B chains of relaxin (in their full-length, shortened or modified forms) by 
methods known ger se for combination of A and B chains of human insulin. 

One such method comprises reducing a mixture of the S-sulphonated A and B chains and then allowing 
the mixture to oxidize in air. 

55 We have also found that the efficiency of the above procedure is improved when one or both of the A 
and B chains is in the form of an S-thioethy!-cys derivative rather than the S-sulpho form. 

In our Australian patent Application No, 15413/83 (PF 4385/82) we also showed that one or both of the 
A and B chains of relaxin can be shortened at the amino and/or carboxy terminii without significant loss of 
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biological activity and with improved combination yields. These techniques apply equally to the preparation 
of human relaxin. 

Another aspect of the invention provides a human relaxin analogue consisting essentially of shortened 
and/or modified forms of the natural B and/or A peptide chains. 

This aspect of the invention also provides a method for producing a human relaxin analogue which 
comprises the step of forming the shortened and/or modified B and/or A peptide chains and combining 
them by any of the methods described above. 

Our investigations with both pig and human relaxin (H1) show that relaxin activity may be present with 
human A chains as short as A{10-24) and B chains as short as B(l0-22) although the expected practical 
minima are respectively A(4-24) and B(4-23). The peptide A(4-24)-B(1-25) Is already known to have relaxin 
activity. 

In general, for the present relaxin structure (H2) the A chain can be varied from A(i-24) to A(10-24) and 
B chain from B(-1-32) to B(10-22). 

The preferred combinations are derived from: 





A 




B 


any 


(1-24) 


with 


(-1-23) 


of 


(2-24) 


any 


(Up to) 




(3-24) 


of 


(-1-31 ) 



Modifications of the 8 and/or A chains, in accordance with the present invention may involve either 
"genetic" modification, as described above, or chemical modification of the B and/or A chains (in either full- 
length or shortened form) prior to combination by the method of the invention. Two types of modification 
may be employed; either singly or in combination. 

The first type involves the modification of one or more of the amino-acids which occur in the natural or 
shortened B and/or A chains. Such modification will generally involve protection of active groups on one or 
more of the amino-acids by methods Icnown ger se, and the protecting groups may. if desired, be removed 
after combination of the (modified) A and B chains. 

Examples of this type of modification include the acetylation, formylation or similar protecting of free 
amino groups, including the N-terminal, amidation of C-terminal groups, or the formation of esters of 
hydroxyl or carboxylic groups. The formyl group is a typical example of a readily-removable protecting 
group. 

The se6ond type of modification Includes replacement of one or more of the natural amino-acids in the 
B and/or A chains witti a different amino add (including the D-form of a natural amino-acld). This general 
type of modification may also involve the deletion of a natural amino-acid from the chain or the addition of 
one or more extra amino-acids to the chain. 

The purpose of such modifications is to enhance the combination yields of the A and B chains, while 
maintaining the activity of the product, i.e., relaxin or an analogue thereof, or to enhance or modify the 
activity of the product for a given combination yield. Such moditication may extend to the production of 
synthetic analogues which have relaxin-biocking or -antagonistic effects. 

A specific example of the first type of modification is tiie modification of the tryptophan (Trp) residue at 
B2 by addition of a formyl group. 

Examples oif the second type of modification are replacement of the Met moiety at 824 witii norleucine 
(NIe), valine (Val), alanine (Ala), glycine (Gly). serine (Ser) or homoserine (HomoSer). 

The invention In this aspect also includes human relaxin analogues formed from natural or shortened B 
and/or A chains modified in accordance with the invention as described above. 

The A and 6 peptide chains, and also prorelaxin and preprorelaxin may be prepared by the usual 
process of gene expression, that is by growing a microorganism containing the appropriate transformed 
transfer vector and isolating and purifying the required peptide(s) produced by the microorganism. 

The peptide products thus produced may bQ in the form of a fusion protein in which the desired peptide 
is linked with a portion of a procaryotic protein. 

The invention Is further described and illustrated by the following description of the experimental 
procedures used and the results obtained thereby. 

Methods and Materials 
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Messenger RNA isolation and cDNA cloning 

Human ovarian tissue obtained during surgery for the treatnnent of an ectopic pregnancy was quickly 
frozen on dry ice and the RNA isolated in 5M guanidiniunn thiocyanate (Merck) according to the method of 
5 Chirgwin et al., 1979. Poly -A* RNA was converted into double stranded DNA (Wickers et al. 1978) and 
. cloned either by the homopolymeric Q/C tailing method into a pBR322 plasmid vector (Chang et al..'1978) 
or by the lambda packaging method using the XGTIO vector (Huynh et a).. 1983). In our experience the 
efficiency of transformation with the pBR322 method (10* recombinantS/ug of cONA) was far less efficient 
than the lambda technique (up to 10^ recombinantaug of cONA). 



Preparation of hybridization probes 

Radiolabeiled probes were prepared by primed synthesis on various DNA fragments using denatured 
15 random primers of calf thymus DNA (Hudson et al.. 1983, Taylor et al., 1976). The DNA template (100-200 

ng) was denatured with the random primers (1 ug) by boiling in 20 u\ of water for 2 minutes. Synthesis was 

initiated by the addition of a 30 ul reaction mixture containing 50 mM Tris-HCI pH 8.0. 50mM NaCI. imM 

DTT. lOmM MgCl2. 5 units of E. coli DNA Polymerase 1 (Klenow fragment), 500 uM each of dCTP. dGTP. 

dTTP and 0.3 uM a-ppj^^TP (Approx. 3000 Ci/mmol. Amersham). After incubation at 37 *C for 30 
20 minutes the reaction was terminated by dilution into 300 u.1 of a buffer containing 0.3M NaCI, 10mM Tris- 

HCI. pH 8.0. ImM EDTA and passed through a Sephadex*G50 column. (1cm x 5cm) in the same buffer. 

The radiolabeiled probe was collected from the peak fractions at void volume and precipitated with. 2 

volumes of ethanol at -20* C for 2 hours using tRNA (10 ug) as carrier. 

25 

Selection of specific cDNA clones 

To screen the human ovarian cDNA clone bank for relaxin specific sequences we used as a probe a 
segment of the previously identified human Hi gene con'esponding to a 400 nucleotide segment coding for 
30 the C peptide and A-chain from amino acid . 64, through the termination codon and including 80 bases of the 
3' untranslated region. A single positive cDNA clone from the pBR322 library was isolated and sequenced. 
23 unique recombinants were isolated from the XGT10 libraries, but of these only 6 were subjected to 
complete nucleotide sequence analysis. 

DNA sequence analysis 

The sequencing strategy and an abbreviated restriction map of the cDNA clones are summarized in 
Figure 1 . The recombinant plasmid in pBR322 was digested with restriction enzymes Hpa II (P>, Hinf I (F) or 
40 Taq (T) and end-labelled using reverse transcriptase and the appropriate o*iabelled deoxynucleotide 
triphosphate (dCTP for Hpa 11, and Taq I. dATP for Hinf I). Fragments were cleaved internally with a second 
restriction endonuclease and then separated by electrophoresis on 8% polyacrylamide gels prior to 
sequencing by the chemical degradation method of Maxam and Gilbert et al. 1977. 

cDNA clones in XGT10 were sequenced by subcloning Eco R1 resfriction fragments into M13mp9 arid 
45 employing the techniques described by Sanger et al. (1977), 



Southern and Northern gel analyses 

50 Performed on purified genomic DNA after restriction endonuclease cleavage by the method of Southern 
(1975) or on purified RNA. The DNA fragments which were used as probes were found to be specific for 
either exon I or exon II of the H1 genomic clone despite having a small amount of flanking sequences. 
These fragments were generated by subcloning into M13mp8 a 500 bp Alu I fragment of the XH7 clone in 
the case of the exon I probe, or a 400bp Eco Rl-Ava II fragment for the exon II probe, A probe from the H2 

55 cDNA clone was generated by digesting with Hinf I and isolating a 300 bp doublet corresponding to the 
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coding region from Asp 1 to the termination codon and including 110 bases of the 3 untranslated region 
(Figure 1). Oligonucleotide probes were synthesized by the phosphite chemistry method of Beaucage and 
Carulhers (1981) and were end-labelled with y^^P-ATP using T4 polynucleotide kinase. Hybridization 
conditions were calculated on the basis of the Q + C content. 

5 

isolation and nucleotide sequence analysis of the Hg genomic clone 

The human genomic lambda library of Lawn et ai (1978) was screened by method described earlier 
10 (Hudson et al, 1983) except that a mixture of DNA fragments corresponding to axons I and II of the HI 
genomic clone was used for the probe as described above. Positive phage were grown In litre scale liquid 
cultures, the DNA isolated and digested with restriction endonucleases prior to mapping with the exon I and 
II probes. A 4 kilobase EcoRl fragment was found to contain the entire exon I coding region which 
differentiated this clone from the homologous Hi gene structure. This fragment was subcloned Into 
fS M13mp8 and sequenced by the technique of Maxam and Gilbert (1977). After digesting with Ava I, 
fragments spanning the coding region were end-labelled and cleaved internally by a second restriction 
enzyme (Hpa II of Hint I) to generate fragments suitable for sequence analysis. 



^0 Isolation of a cDNA clone 

Samples of human corpus luteum were made available to us as a result of surgical Intervention In 
ectopic pregnancies or from lutectomy at the time of Caesarian section. From the RNA isolated from a 
single corpus luteum a cDNA library was constructed in pBR322 providing about 300 unique recombinants. 
25 Screening this library with an HI -cDNA probe revealed a single recombinant with sequence homology to 
. human relaxin I. To increase the total number of recombinants from such small amounts of ovarian tissue 
we constructed cDNA libraries using the X GT10 cloning system (Huynh et af. 1983), Screening with a 
relaxin-specific probe identified 23 unique cDNA clones of which six where characterized as shown in 
Figure 1. Nucleotide sequence analysis revealed that all 6 cDNA recombinants encoded fragments of the 
30 same relaxin structural gene (Figure 2), yet this sequence was different to the genomic clone reported 
earlier (Hudson et al., 1983). We expected that this novel sequence corresponded to the second human 
relaxin gene (H2) which had been observed in genomic DNA. 

Surprisingly, none of the cDNA clones contained a polyadenosine sequence at the 3 end, although the 
• size of cDNA clones in PBR322 and XGTIO (1800 bp and 1900 bp respectively) indicate that large 
36 transcription products were being synthesized during the cloning procedure. These two cDNA clones had 
overlapping sequence identity at the 3' terminus confirming that they were derived from the same mRNA 
structure. We attributed the loss of the poly-A tail either to premature termination of the double-stranding 
transcription reaction or to excessive SI nuclease degradation during the cloning procedure. 

40 

Isolation of a genomic clone corresponding to the second gene 

A thorough screen of 10^ recombinant phage from the human genomic library of Lawn et. al.. (1978) 
using mixed probes specific for exon I or 11 of the XH7 relaxin clone revealed 16 positive phage. Small scale 

4S restriction mapping analysis revealed that 14 of these recombinant phage corresponded to the Hi relaxin 
gene reported earlier (11 were identical to the XH7 genomic clone; 3 were identical to XH5 a different 
genomic clone of the Hi gene as previously reported by Hudson et. al, 1983). However, the other 2 
recombinant phage were Identical and had a unique restriction pattern characteristic of the H2 relaxin gene 
whose structure is given in Figure 1 . The unusual ratio of recombinants reflects either their proportion in the 

50 original genomic library or results from selective growth during amplification. Souttiern blot analyses of this 
new recombinant phage (XH11) using separate probes con'esponding to either exon I or I) of the XH7 clone,- 
revealed that XH11 contained only the exon I coding region. Attempts to find a full length genomic clone 
corresponding to the H2 relaxin gene either in the library of Lawn et al. (1978) or in another library (Or R. 
Crawford, unpublished) have so far been unsuccessful. 

55 The nucleotide sequence of the relaxin coding region of XH1 1 was found to be identical to that 
obsen/ed in the cONA cipne shown in Figure 2. An intron interrupts the coding; region in exactly the same 
position as' in the XH7 genomic clone (Hudson et. al., 1983) suggesting that these genes arose by a gene 
duplication event at some point in evolution. 
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Northern gel analysis 

RNA was isolated from several samples of human corpora lutea taken from different individuals during 
surgical intervention for ectopic pregnancy or during Caesarian section operations. Northern gel analysis 

5 using probes made from the coding region of either relaxing gene revealed that two major mRNA species of 
approximate sizes 1000 bp and 2000 bp were present in five human ovarian RNA samples tested (Rgure 
3). The smaller mRNA species were 2-3 fold more abundant in the RNA samples tested and this result was 
independent of whether the probe used in the analysis corresponded to Hi or H2 relaxln indicating that high 
cross-hydridizatlon rates occur under our experimental conditions. To differentiate whether these two mRNA 

10 species represent the separate products of the H1 and H2 genes, oligonucleotide probes were synthesized 
over a region of minimum homology (60%) between the two relaxin genes (residues 137-144 in Figure 2), 
These synthetic 25 mers were radiolabelled by kinasing with 7'^^P-ATP and used as hybridization probes 
under conditions shown to provide specificity for either the HI or H2 gene (Figure 3). Northern gel analysis 
using these radiolabelled probes revealed that both mRNA species conresponded to products of the H2 

ra gene. We could not detect any transcription products from the HI gene using the specific probes, although 
low level expression (less than 5% of the H2 level) would have been difficult to Identify. 

To analyse the different mRNA transcripts from the H2 gene, we made specific probes from segments 
of the two large H2 cDNA clones corresponding to the coding region and 5 and 3' untranslated regions 
(Rgure 4). The larger mRNA transcript (approximately 2kb in length) selectively hybridized to segments of 

20 the 3' untranslated region from both cDNA clones, from a position approximately 100 bases from the 
tenmination codon. A potential polyadenylation signal exists in the nucleotide sequence of the cONA clones. 
140 bases from the termination codon, and this region does have homology to the porcine relaxin 
polyadenylation site. However, the question of whether the shorter mRNA product is potyadenylated near 
this position cannot be resolved until full length cONA clones corresponding .to both mRNA forms have 

25 been Isolated and characterised. 

In the absence of the genomic sequence of the H2 gene it is impossible to define the mechanisms 
leading to the formation of the two mRNA transcripts. It is possible, like the collagen and ^-microglobulin 
genes, that cleavage of the primary RNA transcript could occur at alternative polyadenylation sites. On the 
other hand we cannot rule out the possibility of alternative splicing, mechanisms such as occurs in the 

3a calcitonin, growth hormone and a-crystallin genes. 



The primary structure of preprorelaxin encoded by the H2 gene 

35 The mode of in vivo processing of the human preprorelaxin genes is not yet fully understood and has to 
be deduced by analogy to the processing of porcine and rat preprorelaxins (Rgure 5). The predicted B and 
A chain stmctures for the HI and h2 genes have been aligned to other members of the relaxin family and 
human insulin in. Figure 5. 

Cleavage of the signal peptide in HI has been predicted (Hudson et al., 1983) to occur after a short 
40 Side chain residue such as .Ala~\ -2 or -4 or after Ser-6. Cleavage after Ala-1 is consistent with the 
homology to porcine preprorelaxin and human preproinsulin. Similarly, cleavage of the H2 signal peptide 
probably occurs after Ala-2 by such analogy, although cleavage after Ala-4 or Ser-6 are other possibilities, 

By analogy to rat and pig prorelaxtns, cleavage at the 6 chain/C peptide junction would occur after Leu 
32 in both HI and H2 precursors. However, both human relaxin B chains possess at positions 29-30 the 
45 conserved dibasic sequence Lys-Arg, which is a known processing site in other prohormones such as 
proinsulin. and cleavage here cannot be excluded. Direct amino acid sequence analysis of relaxin isolated 
. from corpora lutea of pregnancy will be required to settle this point, in the meantime it seems that the most 
likely structure of the Hi B chain would be 32 residues in length (Lys 1 to Leu 32) and the .H2 B chain 
would be 33 residues (Asp-1 to Leu 32). 
50 Cleavage at the C peptide/A chain junction of HI prorelaxin has been predicted (Hudson et al.. 1983) to 
occur after Arg 136 within a group of 4 basic residues because the Arg-Pro imide bond at 137-138 would 
be resistant to proteolysis. H2 prorelaxin has the same sequence of 4 basic residues and a similar 
. processing step after Arg 136 would result in both the HI and H2 relaxin A chains being 24 residues in 
length. 

55 
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Biological Activity of the H2 gene 

As noted In earlier studies on synthetic pig relaxin peptides, there are core sequences in the pig relaxin 
B and A chains which contain all the essential elements for biological activity. Out synthetic studies on the 
5 H1 relaxin peptides has shown that combination of the complete H1 A chain (Arg 137-CYS 160) to 'a 
shortened form the H1 B chain (Lys 1-Ser 25) produced material which possessed biological activity 
(Hudson et al.. 1983). Further studies on both the Hi and H2 gene structures using peptide synthesis 
reveals that both genes code for forms of relaxin which are biologically active in the rat uterine contractility 
assay. 



Chemical Synthesis of a modified human relaxin H2 (hRLX) A(1-24) - B(-1-24) 



15 

i\) Synthesis of human relaxin A-chain. H2 hRLX A(1-24) 

The amino acid sequence corresponding to residues 1 to 24 of the human relaxin A-chain, deduced as 
described above from the nucleotide sequence of the cDNA clone, was synthesized by the solid-phase 

20 procedure according to the general principles described by Merrifield (e.g. Barany, G. and Men'ifield, R.B. 
In "The Peptides". Ed, E, Gross & J. Meienhofer, Academic Press. N.Y.. pp. 1-284, 1980). 

N-a-tertiarybutyloxycarbonyr-4-methyl-benzyl-L-cysteine ('hereinafter "BOC) was coupled to a 1% 
crosslinked polystyrene resin via the phenylacetamidomethyl (PAM) linkage to a level of 0,30 mmole/gm 
using the method of Tam et al. (Synthesis 12, 955-957, 1979). The BOC-L-CYS-PAM resin (8.0 gm) was 

26 transferred to the reaction vessel of the Beckman Model 990 Peptide Synthesizer and the amino acid 
sequence from residues 23 through to 1 was assembled by the stepwise addition of each suitably protected 
amino acid. The amino terminal BOC protecting group of each amino acid was removed by treatment of the 
resin with 35% Irifiuoroacetlc acid in methylene chloride for 30 minutes followed by neutralization with 5% 
dilsopropylethylamine in methylene chloride for 15 minutes. After each treatment the resin was washed 

30 thoroughly with methylene chloride. The next amino acid in the sequence (suitably protected at the aramino 
with the BOC group and where necessary with the 7side-chaln functional group appropriately protected) 
was coupled to the resin using dicyclohexylcarbodiimide (DCC). The resin was stin'ed with the amino acid 
in methylene chloride for 10 minutes prior to the introduction of the DCC which was also dissolved in 
methylene chloride. A 2,5 molar excess (6.0 mmole) of amino acid and DCC was used for each coupling. 

35 After stirring for 1 hour a sample of the resin was removed from the reaction mixture and tested for the 
presence of free amino groups using the ninhydrin procedure of Kaiser et al. 

(Anal. Biochem., 34, 595-598. 1970). !f the ninhydrin test was negative Indicating complete coupling the 
reaction cycle was continued with BOC deprotection, neutralization and coupling of the next amino acid. For 
a positive ninhydrin test the coupling reaction was repeated with further amino acid and DCC. 

40 Amino acids with side-chain functional groups were used as the following protected derivatives: N-a- 
BOC-2.6-dich}oroben2yl-L-tyrosine, N-o-BOC•^chloroben^yloxycarbonyl•L-lysine; N-o-BOC-L-serine 0-ben- 
zyl ether; N-a-amyloxycarbonyl -N°-tosyl-L-arginine; N-o-BOC-L-threonine 0-benzyl ether: N-orBOC-S-ethyl 
mercapto-L-cysteine (for CYS at A-chain sequence position 15, 11 and 10), 

Following the assembly of the 1-24 peptide' sequence, the final BOC group on the amino terminal 

45 arlglnlne was removed using the deprotectdion neutralization cycle and the peptlde-resin dried In vacuo (wt 
of peptide resin 13.0 gm). A portion of the peptide-resin (2 gm) was treated with anhydrous hydrogen 
fluoride in the presence of anisole (2 ml) at o'C for 30 minutes. The total time for contact of the resln- 
peptide with hydrogen fluoride (HF) was kept to a minimum (not more than 70 minutes) by rapid removal of 
the HF under oil-pump vacuum. The resin-peptide was then washed several times with ethyl acetate to 

50 remove excess anisole, the peptide extracted into 1M acetic acid and the solution lyophllized. The yield of 
crude peptide, (with the cysteines at positions 10, 11 and 15 still protected as the S-thioethyl derivative) 
was 392 mg. Initial purification of the crude peptide was by gel-filtration on Biogel P10 in 0,1 M acetic acid. 
The fractions representing the major peak from this column, which eluted at a position corresponding to a 
molecular weight of approximately 3000, were collected and lyophilized. Amino acid analysis of a sample of 

55 this peptide indicated that all the amino acids of the 1 -24 sequence were present in the correct ratio. 

Further purification of the [S-thioethyl Cys^^-^^'^^l-hRLX A(1-24) peptide was effected by preparative 
reverse-phase HPLC on a Waters C-18 Bondapak column using a o:i% TFA-water/acetonitrile solvent 
system. 
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A sample (80 mg) of the peptide purified by gel-fiitratlon was S-sulfonated with a mixture of sodium 
sulfite and sodium tetrathionate (total reaction time of 3 hours) according to the method described by Du et 
al.. (Scientia Sinica. 101. 84-104 (1961)). The precipitate which formed during the S-sulfonatlon reaction was 
removed by filtration and both the precipitate and the supernatant solution dialyzed against distilled water at 
s 4* C for 48 hours. The contents of the dialysis bags were lypphilized to yield 39.5 mg of peptide from the 
supernatant solution and 20.3 mg of peptide from the precipitate which occurred during the S-sulfonatton 
reaction. A sample of the 'soluble' [S-sulfo Cys^^"'^-^^-^*] hRLX A(l-24) peptide was purified by preparative 
reverse-phase HPLC on a Waters C-18 Bondapak column using a 0.1% TFA-wateracetonitrile solvent 
system. 



liil Synthesis of shortened human relaxin B^chain. H2 hRLX B('1-24) 

The amino acid sequence corresponding to residues -i to 24 of the H2 human relaxin B-chain was 

1$ synthesized using the procedures described above and commencing with 6.0 gm N»a- 
tertlarybutyloxycarbonyl-L-methionlne-O-benzyl-L-serine-phenylacetamldo-methyl polystyrene resin with a 
loading of 0.5 mmole Met per gm. The side-chain protecting groups used in the A-chain synthesis were 
also employed for the 8-chain Including the S-ethyl mercapto derivative for both cysteines at positions lO 
and 22. The glutamic acid residues at positions 4 and 5 and the aspartic acid residue at -i were added as 

20 the N-flfBOC-benzyl ester derivative. The glutamine at position 18 was coupled by the active ester 
procedure using N-o-BOC-L-glutamlne-p-nitrophenyl ester In DMF. Following coupling of the tryptophan at 
position 2. 0.1% Indole was added to the trifluoroacetic acid deprotecting reagent and to the subsequent 
methylene chloride washes. 

The fineJ weight of peptide-resin after removal of the BOC group from the amino terminal aspartic acid 

25 residue and vacuum-drying was 8.5 gm, A portion of the peptide resin (3.5 gm) was treated with anhydrous 
hydrogen fluoride in the presence of anisole (2 ml) at 0* C for 30 minutes and the B-chain peptide isolated 
using the procedure described above for the A-chain. The crude [S-thioethyl Cys^°'^^] hRLX B(-l-24) (0.97 
gm) was purified by gel filtration on BloGel P10 in 1M acetic acid followed by preparative HPLC. 

A sample (100 mg) of the peptide purified by gel filtration was S-sulfonated at pH 8.3 for 3 hours, the 

30 reaction mixture filtered and the precipitate and supernatant solutions dialyzed against distilled water. The 
'soluble' peptide recovered after lyophilization was 42.4 mg; the 'insoluble' peptide was 59.5 mg. The 
sulfonated B-chain peptides were further purified by preparative HPLC using a C-18 reverse-phase column 
and 0.1% TFA-water-acetonitrile solvent system. 

35 

(ill) Chain Combination 

The synthetic H2 hRLX A(1-24) and H2 hRLX B(-1-24) peptides were combined using the procedure 
described by Chance and Hoffmann (Australian Patent Application No. 68844/81) for insulin chains wherein 
40 the S-sulfpnated peptides were mixed in a ratio of A : B of 2.6: 1 at a peptide concentration of i6 mg/ml in 
glycine buffer pH 10.5. Dithiothreitol In glycine buffer was then added in an amount to give a total of 1,0 
sulfhydryl groups for each S-sulfp group. The reaction mixture was then stinred in an open vessel for 24 
hours. 

As a further modification to this procedure we have found the the chain combination reaction to form 
45 biologically active relaxin proceeded efficiently when one or preferably both of the peptide chains are used 
as their S-thioethyl-Cys derivatives rather than in the S-sulfo form specified by Chance and Hoffmann 
(op.cit.) in the case of Insulin. The use of S-thioethyl Cys peptides eliminates a reaction and purification 
step required to convert the peptides to the S-sulfo derivatives. In our experience the S-suifonation reaction 
of relaxin peptides is accompanied by side reactions which render the S-sulfo peptides difficult to purify 
50 resulting in low yields. 

Using the above conditions chain combination yields from 1.5 to 6.0% have been achieved as 
measured by biological activity In the rat uterine contractility assay of Wlqvist & Paul (Acta Endocrinol., 29. 
135-136, 1958). 

56 
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Example of Chain Combination Reaction 

Human relaxm H2 [S-thloethyl Cys^**'^^'^^] ^(1-24) (4.2 mg dry wt., 2.4 mg peptide by amino acid 
analysis. 0.84 umole) was dissolved in 500 ixl of 0.1 M glycine buffer pH 10.5 in a 3 ml stoppered plastic 

5 centrifuge tube, Human relaxin H2 (S-sulfo Cys^^'^M B{-l-24) (l.60mg. 1.60 mg peptide by amino acid 
analysis. 0.33 umole) dissolved in 200 ul of 0.1 M glycine buffer pH 10.5 was added and the mixture 
agitated. An aliquot (23.0 ul, 2.21 umole OTT) of a stock solution of dithithreltol (DTT) made up in 0.1 M 
glycine buffer pH 10.5 (0.96 umole DTT in 10 ul) was added to the peptide solution and following a brief 
agitation the reaction mixture was allowed to stand at 4* C for 24 hours open to the air. The mixture was 

10 then centrifuged and aliquots of the supernatant solution tested for relaxin biological activity in the rat 
uterine contractility assay. Aliquots of the reaction mixture inhibited the spontaneous contractions of the rat 
uterus in a dose-related manner. A 75 ul aliquot completely 'inhibited uterine contractions equivalent to a 
chain combination yield of 5.3% as compared to a native pig relaxin A22 B31 standard, 

J6 

Synthesis of authentic human relaxin H2 : hRLX A(1-24) - B(-1-32) 



20 Synthesis of full length H2 human relaxin B-chain :hRLX B(-1-32) 

The amino acid sequence corresponding to residues -i to +32 of the full length H2 human relaxin B- 
chain was syntheslsed using the procedures- described above and commencing with 6.4 gm N-a- 
tertiarybutyloxy-carbonyl-L-leuclne phenyl acetamidb methyl polystyrene resin with a loading of 0.23 mmol 

26 Leu per gm. The side-chain protecting groups used for the A(l-24) and B(-1 - 24) peptides were also 
employed for the full length B-chain including the S-ethyl mercapto derivative for both cysteines at positions 
10 and 22. A modification of this strategy was the use of the N-formyl derivative of BOC-L-tryptophan for 
coupling at sequence positions 27 and 2. 

The final weight of the peptide-resin following chain assembly was 8.2 gm. A portion of the peptide 

30 resin (4.0gm) was treated with anhydrous hydrogen fluoride-anlsole as described In previous examples, to . 
yield 1,50gm of crude [S-thioethyl Cys^^'^^ N-formyl Trp^"] hRLX B(-1 - 32). The crude peptide was 
purified by gel filtration on BioGel P6 in 0.1 M acetic acid. The major peaks eluting from the gel filtration 
column were characterised by amino acid analysis. The fractions with analyses consistent with the -1 to 
+ 32 peptide sequence were collected and lyophilised. 

35 Deformylation of the tryptophan residues was effected by treating the peptide (I00mg) with sodium 
hydroxide solution (5ml) pH 11.5 for 5 min. during which time the peptide precipitated from solution. The 
reaction mixture was neutralised to dissolve the peptide and applied directly to a BioQel P6 column in 0.1 M 
acetic acid. Removal of the formyl groups from tryptophan was monitored by UV spectroscopy by following 
' the disappearance of the N-formyl absorption at 300 nm and the appearance of the characteristic 

40 tryptophan spectra with an absorption maximum at 280 nm. Peptide fractions eluting from the column with 
the correct amino acid aniysis were collected and lyophilised. 

Attempts to further purity the [S-thioethyl Cys^'^'^a] ^rlX q^.^ .32) peptide by preparative HPLC were 
not successful because of loss of peptide by adsorption to the column media. Peptide purified by gel 
chromatography was used directly in chain combination experiments. 

45 

' ■ \ 

(ii) Chain combination of A(1-24) with B(-1 - 32) : preparation of human relaxin H2 . 

The synthetic S-sulfonated and S-thloethyl H2 human relaxin A(1-24) peptides were coupled to S- 
30 thioethyl H2 human relaxin B(-1 - 32) using the same chain combination procedures described previously 
for the shortened B-chain (-1 - 24), Samples of the recombination mixture were tested for relaxin biological 
activity in the rat uterine contractility assay. Aliquots of the reaction mixture Inhibited the spontaneous 
contractions of the rat uterus in a dose-reiated manner. A lOOul aliquot completely inhibited uterine 
contractions^qulvalent to a chain combination yield of 3.0% as compared to a native pig relaxin A22 B31 
55 . stjandard. 

The features disclosed in the foregoing description. In the following claims and/or in the accompanying 
drawings may. both separately and in any combination thereof, be material for realising the Invention in 
diverse forms thereof, 
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35 . 

Claims 

1. A. human H2-relaxin analogue characterized in that it constists esentially of shortened and/or modified 
40 forms of the natural B and/or A peptide chains. 

2. A human H2-relaxin analogue, characterized in that one or both of A and B chains in the full length or 
shortened form is modified by the addition of a protective group to a free arnino group. 

3. An analogue as claimed in Claim i . characterized in that the modification comprises the replacement 
of at least one of the natural amino-acids in one or both of the A and B chains with a different amino acid. 

45 4. An analogue as claimed in Claim 1. characterized in that the modification comprises the deletion of at 
least one of the natural amino-acids from one or both of the. A and 8 chains and/or the addition of at least 
one extra aml|io-acid to one or both of the natural A and B chains. 

5. An analogue as claimed in Claim 1 , characterized in that the A-chain is shortened by up to 9 amino- 
acids at the amino terminus and/dr the B-chaIn is shortened by up to 9 amino acids at the amino terminus 

so and up to 9 amino-acids at the carboxyl terminus. 

6. An analogue as claimed In Claim 4. characterized in that it consists of any one of the A-chains A(1- 
24). A(2-24). A(3-24) In combination with any one of the B-chains 8(1-23) to 8(1-32). . 

7. A process for preparing human H2-relaxin or an analogue thereof, characterized in that the A and B 
chains of relaxin In their full-length, modified or shortened forms are combined by a method known ger se 

5S for combination of the A and B chains of insulin. 
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8. A process for preparing human H2-relaxin or an analogue thereof, characterized in that the A and B 
chains of relaxin in their full-length, modified or shortened forms are combined by reducing a mixture of the 
S-sulfonated and/or S-thloalkylated A and B chains, allowing the mixture to oxidize in air and recovering the 
relaxin or retaxin analogue thus produced. 
5 9. A process for preparing human H2-relaxin or an analogue from human H2-prorelaxin or an analogue 
thereof, characterized in that the sulfhydryl groups on the A and B peptide chains are converted to disulfide 
cross-links between said A and B peptides and the C-peptide Is then excised. 
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FIG.2A, 


FIG.2B 


FIG.2C. 





Fig. 2 
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-Signal peptide 



Met Pro Arg Leu Phe 
"^AUGCCU CGC CUG UUU 



*:^Tk -k-k^ -k^*^ 

AUG ecu CGC CUG UUC 
Met Pro Arg Leu Phe 



H1 



-20 








Phe 


Phe His Leu Leu 


Gly Vol 


Cys Leu 


UUU 


UUC CAC CUG CUA 


GGAGUC 


UGU UUA 




k-k* k** k** 






UUG 


UUC CAC CUG CUA 


GAA UUC 


UGU UUA 


Leu 


Phe His Leu Leu 


Glu Phe 


Cys Leu 



Met Glu Glu 
AUG GAG GAA 

AAG GAC GAU 
Lys Asp Asp 



10 

Vol He Lys Leu Cys Gly Arg Glu Leu Vol Arg 
GUU AUU AAA UUA UGC GGC CGC GAA UUA GUU CGC 

irk-A k-kk k-k-k kirk kk-k -kick irkk kick -kkk -kkk kkk 

GUU AUU AAA UUA UGC GGC CGC GAA UUA GUU CGC 
Vol Re Lys Leu Cys Gly Arg Glu Leu Vol Arg 



30 

Arg Ser Leu 
AGG UCU CUG 

AGG UCU CUG 
Arg Ser Leu 



C- peptide 



40 



Ser Gin Glu Asp Ala Pro Gin Thr Pro Arg Pro 
AGC CAG GAA GAU GCU CCU CAG ACA CCU AGA CCA 

*** *★* kkk *kk[ kkk kk* k-kir kkk *** *** *** 

AGC CAG GAA GAU GCU CCU CAG ACA CCU AGA CCA 
Ser Gin Glu Asp Ala Pro Gin Thr Pro Arg Pro 



60 



Glu Thr He 


Asn Met 


Met 


Ser 


Glu Phe 


Vol 


GAA ACC AUA 


AAUAUG 


AUG 


UCA 


GAA UUU 


GUU 


*** ** *** 


* ***• 


*** 


* 


*** ** 


** 


GAAACU AUA 


AUU AUG 


AUG 


UUG 


GAA UUC 


AUU 


Glu Thr ile 


Ile He- 


Met 


Leu 


Glu Phe 


Ile 



70 

Ala Asn Leu Pro 
GCU AAUUUGCCA 

k-kk ********* 

GCU AAUUUGCCA 
Ala Asn Leu Pro 



F/G.2A. 
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-10 

Leu Leu Asn Gin Phe Ser Arg Ala Vol Ala 
CUA OJG AAC CAA UUU UCC AGA GCA GUC GCG 
•*** *** ★** *** 

CUA CUG AAC CAA UUU UCC AGA GCA GUC GCG 
Leu Leu Asn Gin Phe Ser Arg Ala Vol Ala 



B-chain 


-1 


1 




Asp 


Ser 


Trp 


GAC 


UCA 


UGG 








GCC 


AAA 


UGG 


Ala 


Lys 


Trp 
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Ala Gin He Ala He Cys Gly Met Ser Thr Trp Ser Lys 
GCG CAG AUU GCC AUU UGC GGC AUG AGC ACC UGG AGC AAA 

*** *** *** *** ****** ****** ****** *** -k-k* *** 

GCG CAG AUU GCC AUU UGC GGC AUG AGC ACC UGG AGC AAA 
Ala Gin He Ala He Cys Gly Met Ser Thr Trp Ser Lys 



50 

Vd Ala Glu He Vol Pro Ser Phe He Asn Lys Asp Thr 
GUG GCA GAA AUU GUG CCA UCC UUC AUC AAC AAAGAU ACA 

•Mrk-kkie *** *** ** *7W! **★ *tWc *** *** *** *** *** 

GUG GCA GAA AUU QUA CCA UCC UUC AUC AAC AAA GAU ACA 
Vol Ala Glu He Val Pro Ser Phe Ue Asn Lys Asp Thr 



Gin 

CAG 
* * 

CCG 
Pro 



Glu Leu Lys 

GAG CUG AAG 
*** *** *** 

GAG CUG 
Glu Leu Lys 



Leu Thr 

UUA ACC 
* ** 

AAG GCA GCC 
Ala Ala 





80 






Leu Ser Glu 


Met 


Gin Pro 


Ala 


CUG UCU GAG 


AUG 


CAG CCA 


GCA 


** *** *** 


* * 


** *** 


** 


CUA UCU GAG 


AGG 


CAA CCA 


UCA 


Leu Ser Glu 


Arg 


Gin Pro 


Ser 



F/G.2B. 
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Leu Pro 

UUA CCA 
*** 

UUA CCA 
Leu Pro 



Glu 



90 

Leu Gin Gin His 
CUACAA CAACAU 



Gin 
CAG 

GAGCUACAG CAG 



Leu Gin Gin 



UAU 
Tyr 



Vol Pro 
GUACCU 

•ki^k kick 

GUACCU 
Vol Pro 



Val 
GUA 

GCA 
AlQ 



Leu Lys Asp Ser 
UUA AAAGAU UCC 

UUA AACGAUUCC 
Leu Lys Asp Ser 



Arg Gin Ser Glu Ala Ala Asp Ser 
AGA CAA AGU GAAGCC GCA GAC AGC 

•k-k ickie *ib»r 

AGG CAA AGU GAAGCC OCA GAC AGC 
Arg Gin Ser Giu Ala Ala Asp Ser 



140 



Ser 
AGU 
* * 

AAU 
Asn 



120 

Pro Ser Glu Leu Lys 
ecu UCA GAAUUAAAA 

ecu UCA GAAUUAAAA 
Pro Ser Glu Leu Lys 

150 



Leu 


Tyr 


Ser 


Ala Leu 


Ala Asn 


Lys Cys Cys 


His Val 


Gly Cys 


cue 


UAC 


AGU 


GCAUUG 


GCUAAU 


AAAUGU UGC 


CAU 6UU 


GGUUGU 


X -k 


•k-kit 




kk-k kic 


* -k 


•kkk ii-kk 




•kk-k ftk-k 


CCC 


UAC 


GUG 


GCACUG 


UUU GAG 


AAA UGU UGC 


CUA AUU 


GGU UGU 


Pro 


Tyr 


Val 


Ala Leu 


Phe Glu 


Lys Cys Cys 


Leu He 


Gly Cys 
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100 



Ser 


Leu 


Leu 


A6U 


CUU 


cue 


* * 




* 


AAU 


CUU 


AGC 


Asn 


Leu 


Ser 



no 

Phe Qu Glu Phe Lys Lys Leu He Arg Asn 
UUUGAAGAAUUUAAG AAACUU AUUCGC AAU 

*T>«yC ********** ****** ****** *** 

UUUGAAGAAUUUAAG AAACUU AUUCGC AAU 
Phe Glu Glu Phe Lys Lys Leu He Arg Asn 



130 



Tyr Leu Gly Leu Asp Thr His Ser 
UAC UUA GGC UUG.GAUACU CAU UCU 
*** *** *** ********* ***• *** 

UAC UUA GGC UUG GAU ACU CAU UCU 
Tyr Leu Gly Leu Asp Thr His Ser 



Thr Lys Arg Ser Leu Ala 

ACC AAA AGA UCU CUU GCU 
******** ********* 

ACC AAAACG UCU CUU GCU 
Thr Lys Arg Ser Leu AIq 



Arg Phe 

AGAUUU 
* * * * 

AAAUAU 
Lys Tyr 





A- 


"fhnin 


Arg 


Lys Lys Arg 


Gin 


CGA 


AAAAAGAGA 


CAA 


* * 


*** ****** 


* * 


CAA 


AAAAAGAGA 


CGA 


Gin 


Lys Lys Arg 


Arg 




160 




Cys 






UGCU6A 




*** 






UGCUGA 




Cys 







F/6.2a 



9/4/2007, EAST Version: 2.1.0.14 



EP 0 303 033 A2 




9/4/2007, EAST version: 2.1.0,14 



EP 0 303 033 A2 




Fig. 4. 
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«/) 

UJ 




Cys 
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Tyr[ 




Phe 
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Leu 


Leu 


Leu 
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Lys 


Arg 


Arg 


Lys 


1 
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Ala 


Ala 


Ala 


Ser 


Ser 
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Leu 
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01 
>— • 
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Gin 


Ser 


Ser 


Asp 


Ser 


Asp 


Asp 


Tyr 


Arg 


Arg 
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Arg 
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Arg 


Arg 
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